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Summary: Joint models are used in ageing studies to investigate the association between longitudinal markers and a 
time-to-event, and have been extended to multiple markers and/or competing risks. The competing risk of death must 
be considered in the elderly because death and dementia have common risk factors. Moreover, in cohort studies, time- 
to-dementia is interval-censored because dementia is only assessed intermittently. So subjects can become demented 
and die between two follow-up visits without being diagnosed. To study pre-dementia cognitive decline, we propose 
a joint latent class model combining a (possibly multivariate) mixed model and an illness-death model handling 
both interval censoring (by accounting for a possible unobserved transition to dementia) and semi-competing risks. 
Parameters are estimated by maximum likelihood handling interval censoring. The correlation between the marker 
and the times-to-events is captured by latent classes, homogeneous groups with specific risks of death and dementia 
and profiles of cognitive decline. We propose markovian and semi-markovian versions. Both approaches are compared 
to a joint latent class model for standard competing risks through a simulation study, and then applied in a prospective 
cohort study of cerebral and functional ageing to distinguish different profiles of cognitive decline associated with risks 
of dementia and death. The comparison highlights that among demented subjects, mortality depends more on age 
than duration of dementia. This model distinguishes the so-called terminal pre-death decline (among non-demented 
subjects) from the pre-dementia decline. 
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1. Introduction 

Joint models are becoming increasingly popular as they allow an analysis of the association 


between the risk of an event and the change over time of a longitudinal marker (Tsiatis and 


Davidian, 2004 Rizopoulos, 2012). In a cognitive ageing study, the link between cognitive 


decline and dementia needs to be understood to better describe the course of the disease in 
the pre-diagnostic stage, and to develop prediction tools for the risk of dementia. Moreover, 
modeling the evolution of cognitive markers without modeling jointly the risk of dementia 
may lead to biased estimations of the change over time of the marker as collection of cognitive 
measures is often stopped after dementia onset, inducing non-random missing data. Joint 
models correct for this bias by accounting for the association between the marker and the 
time-to-event. 


Wulfsohn and Tsiatis (1997) proposed shared random effects models where a function of the 


random effects from the longitudinal model is included in the survival model, thus capturing 
the correlation between the time-to-event and the marker. The risk of the event is then 
partly explained by the individual dynamics of the marker trajectory. An alternative is the 


joint latent class mixed model, developed by Lin, Turnbull and Slate (2002) which considers 


a heterogeneous population, divisible into several homogeneous latent subgroups, with a 
specific risk of the event and a specific evolution of the marker. A significant computational 
advantage is the replacement of the integrals on the random effects by a sum over the classes, 
such that the likelihood of this model has a closed form. 

When studying the risk factors or natural history of Alzheimer’s disease, the most frequent 
cause of dementia in the elderly, it is important to account for the competing risk of death 
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et al. (2008) proposed a shared random effects model accounting for competing risks and 


Proust-Lima and Jacqmin-Gadda (2015, submitted) developed a joint latent class model for 


competing risks to study multiple longitudinal markers of different natures. However, studies 
of the risk of dementia are made more difficult by the interval censoring of time-to-dementia. 
Indeed, in cohort studies, patients are observed intermittently and the age at dementia onset 
is not precisely known as dementia can only be diagnosed at clinical follow-up visits. As such, 
there is an interval of uncertainty between the last visit where the patient has been seen to be 
healthy and the visit where a diagnosis was made. More importantly, a patient can become 
demented and die between two visits without being diagnosed as demented. Consequently, 
the risk of dementia may be underestimated when interval censoring is not accounted for, 
for example when considering only the first observed event in the standard competing risks 


model. Joly et al. (2002) proposed an illness-death model to fix this issue but this has not 


yet been implemented in a joint model. Death and dementia are semi-competing events since 
dementia can not occur after death but death occurs after dementia. 


To our knowledge, only one joint model combining a multi-state model and a mixed model 
for a longitudinal marker has previously been proposed. Within the framework of the shared 
random effects approach, Dantan et al. (2011) described a joint model combining a two-phase 
mixed model with a random change-point and a multi-state model. The underlying clinical 
idea was an acceleration of the cognitive decline before dementia onset, which was modelled 
by a second phase with a different slope in the mixed model. In this model, interval censoring 
was not a critical issue as death depended on the current value of the marker and not on the 
current state. 


In this work, we propose a joint latent class illness-death model for semi-competing interval- 
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censored events and a longitudinal marker. We propose two versions of the model, a rnarko- 
vian and a semi-markovian version, and an extension for the joint analysis of multiple 
longitudinal markers. In the following section, we detail the model and the estimation 
procedure. In section 3, we present a simulation study to evaluate the estimation procedure 
and compare the proposed approach with a joint latent class model for competing risks 
that does not take into account interval censoring. In section 4, the model is applied to 
study cognitive decline before Alzheimer’s disease diagnosis and death using data from the 
French Paquid cohort, including 3,777 subjects followed over 20 years with regular cognitive 
evaluation. 

2. Methods 

2.1 Notations 

Let Yij denote the score of the psychometric test for subject i, for i — 1,..., N, at time t VJ , for 
j = 1, ..., n t . We denote by TV 4 the age at dementia onset and Tf the age at death. We assume 
that age at dementia onset is interval-censored while age at death is only right-censored as 
generally, exact ages of death are collected in cohort studies. Thus, A denotes the vector 
of collected variables about the event: A = (A, A, Sf, T t , 4, D ) T where T 0i is the age at 
inclusion, A is the age at the last visit where the subject has been seen to be healthy, Ri 
is the age at the visit of diagnosis if the subject is diagnosed with dementia (Ri = +oo if 
not diagnosed), R is the age at death or at the end of the follow-up, Sf is the indicator 
of Alzheimer’s diagnosis or dementia (8f — 1 if R, t ^ T t and 0 otherwise) and S[ J is the 
indicator of death (Sf = 1 if Tf = and 0 otherwise). 

2.2 Joint latent class illness-death model 

The model relies on the hypothesis that the population is heterogeneous and can be divided 
into G homogeneous latent classes. Each class has specific transition intensities for dementia 
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and death and a specific marker trajectory, as displayed in Figure [lj A central assumption 
states that the marker and the times-to-events are independent conditionally on the classes 
and covariates. 


[Figure 1 about here.] 


We first describe the probability 7p g , for subject i, of belonging to class g, for g — 1,..., G, 
with a multinomial logistic model: 


p( \ ex P( X pi Cg) 
n ig = P(a = g) = 


Em= i ex P( X pi C. 


( 1 ) 


The latent class membership variable c t is C; = g if subject i belongs to class g and X pi is 
a vector of covariates. We choose class G as the reference class so that (g = 0 to ensure 
identifiability. 


We denote by A(-) the latent process which stands for the true cognitive level. The 
conditional distribution of A(f) given the latent class is defined by a standard linear mixed 
model, without residual error, with class-specific parameters: 

= Xfig + Uig, (2) 

where X l3 is a vector of covariates of subject i at time t t j , /3 g is the vector of class-specific 
regression parameters and Z ig is a sub-vector of X ig . The random effects Ui g rv.; JV(0 ,a]B) 
have proportional variance matrices over the classes, with <Jq — 1. We denote by U the 
Cholesky transformation of the matrix B , which is a lower triangular matrix satisfying 
UU T = B. The marker Y tJ is considered as a measure with error of the latent process 
Aat time t i3 : 


Yij = A iitij) + €ij with eij ~ W(0, <jg). 


(3) 
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The transition intensities of dementia and death are modelled simultaneously using an 
illness-death model with three class-specific transition intensities (Figure [I]). In this way, we 
can distinguish the transition intensities of death for healthy and demented subjects. We 
propose both a markovian and a semi-markovian model. 


2.3 Markovian model 

Given the latent class g, the transition intensity from state k to state l depends on age t and 
it is modelled by a proportional hazards model with class-specific parameters: 

akiigit) = a° klg (t ) exp(Wj[ li Tkig), (4) 

where a klg is the baseline transition intensity, W kll is a vector of covariates and 'fkig are class- 
specific regression parameters. In the following paragraphs, we will refer to the cumulative 
transition intensities using the notation: A k i g (t ) = / a k i g {s)ds. 

Jo 


2.4 Semi-markovian model 

Alternatively, the transition intensity of death among demented subjects may depend on the 
time spent in the dementia state instead on age, leading to a semi-markovian illness-death 
model: 

«12 ig(t, T A ) = a 12 igit - T A ) = a° 12g (t - T A ) exp(W7 2i 7i2 g ), (5) 

where T A is the age at dementia onset so t — T A is the time spent in the dementia state. 


2.5 Log-likelihood of the markovian model 

Let 6g denote the vector including the regression, variance and baseline transition intensities 
parameters. The contribution £, of any subject i to the global log-likelihood C(9g) is the 
weighted sum of his or her contributions over the G classes. According to the conditional 
independence assumption, the individual conditional contribution to the likelihood given the 
class is the product of the conditional contributions of the mixed model and of the multi-state 
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model, as follows: 


N N G 

C{9g) = ^2Ci = J^log l^FigfiYild = g-e G )P(D i \c i = g-,0 G )] 

i= 1 i= 1 g=l 


N G 




g— Aa\i g (Toi-fiG) — Ao2ig(Toi\8 g) 


( 6 ) 


i= 1 3=1 

where f(Yi\ci = g\ 0 G ) is a multivariate gaussian density with mean E ig = Xj/3 g and variance 
matrix Vi g = a g ZiBZj + a^I and X t and Zi are design matrices with row vectors Xj g and 
Zj-. Then, P(Di\ci = g\ d G ) is detailed below for each possible observation pattern for death 
and dementia (as illustrated in Figure [2]). The second part accounts for the delayed entry, 
representing the probability of being alive and healthy at entry, which is the condition for 
inclusion in the sample. 


[Figure 2 about here.] 

• Subject diagnosed as demented (cases 1 and 2, Figure [2]): 


PfgiToi, Li, Ri, 1, Tj, 8 ![; 9 G ) — 


R% 


3 A ° li9(u) e Aa2ia{u) a mig {u) e ^ A ^ T i) A i 2 ig (u)) <\ u 


JLi 

The subject remained healthy and alive until age u between Li and Ri, became demented at 
u, remained alive until and possibly died at T t (if 5® = 1). 


• Subject observed healthy at the end of the follow-up (cases 3 and 4, Figure [2]): 


Pi h g (T 0i , L u Ri, 0, Ti, 5°- e G ) = Q- A oiig( T i)-Ao2i 3 (Ti) ao2ig ( T .f 

The subject remained healthy and alive until R and possibly died at T % . Case 4 corresponds 
to subjects healthy at the last visit and with no information on vital status after this visit. 
Case 3 is not observed in the Paquid study because subjects never die the very day of the 
visit. Consequently, we can never be totally sure that a subject who died was free of dementia. 
Nevertheless, this case may be observed for other pathologies. 


Subject with unknown dementia status at the end of the follow-up (cases 5 and 6, Figure 


Joint latent class model for longitudinal data and interval-censored semi-competing events 


7 


Pig(Toi, U, Ri, o ,Ti, 5 f; 0 G ) = e -*>u a m)-A oaig (T i ) a 02ig (Ti) s ? 

rTi 

+ / Q- A <nig{u)-A 0 2 ig{.u) aQlig (u) e -( A i 2 ig(Ti)-A 12 i g {u)) ai 2 ig (Ti) S ^du 

Jhi 

The numerator of Pf g accounts for the two possible trajectories: either the subject remained 
healthy until the end of the follow-up Tj (and possibly died), or he/she became demented 
between the last visit L, : , where he/she was observed healthy, and 7j. If so, the subject 
remained healthy and alive until age u between L* and T % , became demented at u, remained 
alive until T t and was lost to follow-up or died at T { . 


2.6 Likelihood of the semi-markovian model 

In the case of a semi-markovian model, the individual conditional contribution to the like¬ 
lihood of the multi-state model is changed for the observation patterns 1, 2, 5 and 6 as 
follows: 


Subject diagnosed as demented (cases 1 and 2, Figure [2]): 


cRi 


P d (T 0i , Li, 1 , T i: 6°; 0 G ) = I e~ A ^~ A ^ a 01ig (u ) e~ A ^ T ^ a 12ig (Ti - u) 5 ? du 


Subject with unknown dementia status at the end of follow-up T) (cases 5 and 6, Figure 


P? g (T 0i , Li, Ri, 0, Ti, 5 f; 0 G ) = ao2ig ( Ti ) S ? 

i - A 01i g {u)-Ao2ig(u) amig (^ Q-Al 2 ig(Ti-u) _ U ) 5 F d.U 

This model assumes that the transition intensity of death among demented subjects 
depends on the duration of dementia and not on age. 


+ fe 


2.7 Optimisation algorithm 

The maximum likelihood estimators are obtained for a fixed number of classes G by a 


Newton-Raphson-like algorithm (Marquardt, 1963). If necessary, at each iteration p, the 
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Hessian matrix H ( - p> is diagonal-inflated to obtain a positive definite matrix H*( p \ The vector 
of parameters is then updated by 6q +1) = 6]q — n f/(6 l ^) T [i/*^] _1 f/(^' > ) with 11(6^) the 
gradient at iteration p and n the improvement control parameter, optimized using a line 
search strategy. The convergence criteria are reached when the change in the likelihood, the 
change in the estimates between two iterations and the criterion U(6q^) t [H^ p ' , ]~ 1 U(6q' > ) are 
less than 10 -3 , 10“ 3 and 1CF 2 respectively. The variances of the estimates are obtained with 
the inverse of H^ p \ For each value of G, the estimation process is repeated with different 
initial values to insure convergence. Finally, the number of classes G is chosen by minimis¬ 


ing the Bayesian Information Criterion (BIC) (Schwartz, 1978), which favors parsimonious 


models and is recommended for mixture models (Hawkins, Allen and Stromberg, 2001). 


2.8 Extension to multiple non-gaussian markers 


The model can be extended to the analysis of K non-gaussian markers as in Proust-Lima et 


al. (2009). In psychometrics, quantitative tests are frequent, with asymmetric distributions 


and ceiling or floor effects. Moreover, as these tests are highly correlated, they may be 
considered as measures with error of a common latent process that stands for the true latent 
cognitive level underlying the various tests. A parametric monotonic function \l !k(',Vk), such 
as a Beta cumulative distribution function or a spline function, with r] k a vector of test- 
specific parameters, can then be used to model the link between the observed markers 17 /k 
and the latent process A(tij k ): 


^k(Y ijk ; r ] k ) = A i(t ijk ) + X k J /3 k + e ijk with e ijk ~ J\f( 0, a 2 J (7) 

Note that for more flexibility, the model ([7| may also include marker-specific effects (3 k of 
some covariates X k -. Consequently, we can define transformed scores Y t j k for test k and 
subject i at time t l]kl for i = 1 = 1 ,...,7i ik ,k = on the scale of the latent 

process: 


1 ijk ^ k^X^ijki Fjk) 


( 8 ) 
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The log-likelihood is then defined by: 

N G K n ik 

C(9g) = E 1o § ( E ^9%( Y i\ci = 0 ; 0g) [ n n %))] P{Di\a = g-, 0 G )) 


i =1 9 =1 

N G 


k =1 j'=l 


JV A' n ife 


= E'°s[E = 9 ; « G ) p ( d , ic=9; o G )] + E E E log 

2=1 #=1 2=1 k= 1 J = 1 

where J is the Jacobian, $ g (Yj|cj = g',9 G ) is a multivariate gaussian density with mean 
E ig = (Ej lg , ...,Ej Kg ) T and the elements of E ikg are E ijkg = Xj(t ijk ) /3 g + /3 k , with 

Xj (■ tij k ) is the vector of covariate values included in X t at time t igk , and variance matrix: 


14: = 



'zr 




2 T 

<4 1 

0 

0 

= 


a gB 1 

(^ 1T - 

zD + 

0 


0 


k z G 




v 0 

0 

2 r 

< - r eK^ n iK ) 


2.9 Goodness-of-fit 

Once the parameters are estimated, we can compute the posterior probability of belonging 
to class g, 


P(c i = g\Y i ,D i -§ G ) = 


n ig f(Yj\ci = g\ 0 G ) P(Di\cj = g~, 0 G ) 
Em =1 Kimf(Yi\ci = fn\ 9a) P(Di\ci = m; § G ) 


(9) 


and subjects are assigned to the class with the highest probability. First, we propose to assess 
the goodness-of-fit of the longitudinal predictions, conditional on the classes. To do so, we 
split the timescale into five-year age groups [r q ,r q+ 1 ]. For each class g, we then compare 
the class-specific predicted mean evolution of the marker, weighted by the posterior class 
membership probability 


I- 1 99 


E 


(i,j)\r q <tij<T q+ i 


E(Yij\ci = g] 9 g ) P{a = g\Yi, D t ; 9 G ) 


E 


(i,j)\r q <tij<r q+ i 


P(c i = g\Y i ,D i -9 G ) 


to the observed mean evolution weighted by the same posterior probability: 

E(i,j)|T,<tij<r 9+ i Yij P(a = g\Yi, Di ; 9 G ) 


kgg 


E 


(i,j)\r q <tij<Tg +1 


P{c i = g\Y i ,D i] 9 G ) 
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The assessment can also be done conditionally on the random effects, comparing jl° gq to 


with u ig 


V gq = 


E(ui\Yi , 


i,j)\T q <tij<T q+1 E{Yij\ci — g, Uig ; §g) P{(k — g\Yi, Di ; §q) 
Yl(i,j)\T q <t i:i <T q+ i F(ci = d\Yu Di] §g) 

Ci = g] 0 g ) the bayesian estimates of the random effects given the class 


g- 


Secondly, to evaluate the goodness-of-fit of the parametric illness-death predictions condi¬ 
tionally on the classes, we compare the predicted class-specific cumulative incidences of the 
three transitions to the class-specific predictions obtained by a semi-parametric illness-death 


model (Touraine, Helmer and Joly, 2013). Each transition intensity of this model is modelled 
by a proportional hazards model with baseline transition intensities modelled by M-splines 
and estimated by penalized likelihood. The contribution to the likelihood of any subject i is 
weighted by the individual posterior probability P(q = g\Y i} D^Og) obtained by the joint 
latent class illness-death model. Note that the cumulative incidences for transitions 0-1 and 
0-2 are estimated given that the subject is alive and healthy at age 65, and the cumulative 
incidence for transition 1-2 is estimated given that the demented subject is alive at age 65, 
as follows: 


Foig{t) 


Fl2 g{t) 


\ Q-A 0 l g (u)-A 02 g {u) aolg ^d U 
'65 ; 


,-Uoi ff (65)-A 02a (65) 


ft 


'65 


3 Ai2s(m ) ai2g(u)du 


3— Ai2 3 (65) 


1,2 


3. Simulations 

3.1 Design 

We carried out simulations in order to evaluate the estimation procedure and compare the 
estimations with those obtained using a joint latent class model for competing events without 
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accounting for interval censoring. 

Data were generated with a model with 2 latent classes with probability = 0.5 (£i = 0) 
for the first class. Of note, 7Ti represents the probability of belonging to the first class in the 
general population. As we only include in the cohort subjects who were healthy and alive 
at the first visit, we introduce a selection bias. The proportion of each class in the selected 
sample may then be different. 

For each subject, age at entry is generated from a uniform distribution on [65,85]. Age at 
dementia onset and age at death (for demented and non-demented subjects) are generated 
from Weibull distributions with class-specific parameters (shape parameter and scale 
parameter A^ for transition intensity from state k to state l in class g). The transition 
intensities account for a common effect of a binary covariate X, also generated from a binomial 
distribution with parameter 0.5: 

<Xkiig(t) = Ajg * lklXl k = 0,1; 1 = 1, 2; g = 1, 2 (10) 

The scores of the psychometric test are generated by a linear mixed model including fixed 
and random effects on the intercept and the slope, with an adjustment for the covariate X, 
common over the classes: 

Yij = /3 0g + p lg tij + ufj + uQ tij + Px X. u u ig ~ J\f( 0 , a 2 g B) ( 11 ) 

Time is a linear transformation of age: t = ag ^~ 65 . No parametric transformation T is used 
so the generated scores are considered as the observed ones. Two designs of follow-up were 
generated: the follow-up visits were scheduled either every 2 or 4 years from inclusion to the 
minimum between the visit following dementia onset, death or the administrative right- 
censoring which is 20 years after inclusion. For each design, we generated 500 samples 
of 500 subjects. The simulated parameters were similar to the ones obtained by a joint 
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markovian illness-death model with two latent classes on the Paquid dataset, without any 
linear transformation. 

We define the age at diagnosis as the age at the first visit following dementia onset if the 
generated age at death is old enough. Subjects who die before the next visit are considered 
as censored for dementia at the last visit before dementia onset. 

On average, over the 500 samples simulated within the ’’visits every two years” framework, 
23% of the subjects were observed demented, 17.7% died after dementia diagnosis and 62% 
died without dementia diagnosis. An average of 33.8% were allocated to the first class, of 
which 33.8% were observed demented, 65.3% died with no dementia diagnosis and 30.9% 
died after dementia diagnosis. In the second class, 17.9% were seen demented, 60.5% died 
with no dementia diagnosis and 10.9% died after dementia diagnosis. 

We also estimated a joint latent class model for competing risks, which does not account for 
interval censoring, on the same simulated data. In the standard competing risks framework, 
the outcome is a couple (T, 5) with T the time to the first event or the censoring time 
and 5 the indicator of the cause of the first event (5 = 1 if demented, 5 = 2 if dead and 
5 = 0 if censored). The estimation of this model on interval-censored data requires to impute 
this couple (T, 5) for some subjects. If the subject is observed demented before death, the 
recorded transition is the health-dementia transition at age of diagnosis. If the subject dies 
before the dementia diagnosis, we consider that the health-death transition occurs at age of 
death and time-to-dementia is censored at age of death. 
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3.2 Results 

Table [I] displays the results of the simulation study for (a) visits every 2 years, (b) visits 
every 4 years, for the joint latent class model for interval-censored semi-competing events on 
the left and for the joint latent class model for standard competing risks on the right. 


[Table 1 about here.] 

The top left part of Table [I] shows small biases and good coverage rates of the 95% 
confidence interval for the 25 parameters except for the two scale parameters of the dementia- 
death transition, A 121 and Aj 22 , which have lower coverage rates because their standard 
errors are under-estimated. This may be due to the small number of observed transitions 
from dementia to death, in these 500-subject samples. Indeed, the simulations made on 1000 
subjects and presented in Web Table 1 show better coverage rates. 

The simulations were also carried out with visit intervals of four years. We can see on 
the bottom left part of Table [l] that the parameters are still well estimated but have higher 
variances, especially for the illness-death parameter estimators, as the number of unobserved 
transitions increases when the censoring interval gets bigger. 

When compared with the competing risks estimations on the top right part of Table [lj we 
observe higher biases for the shape parameters for the transition toward dementia in both 
classes, Agn and A^, and toward death A| )21 and an under-estimation of the standard errors 
of the four parameters for the transition toward dementia. These trends are more pronounced 
for the 4-year-visit-interval data (see part b in Table [I]), leading to poor coverage rates that 
worsen further when the sample size increases due to smaller standard errors (see Web Table 
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1 for N=1000). 

We also assessed the semi-markovian model, with visit intervals of two and four years. The 
estimates have small biases and good coverage rates in the longitudinal and the illness-death 
parts (see Web Table 2). 


4. Application 

The joint latent class illness-death model was applied to a French prospective cohort, the 
Paquid cohort, to distinguish different profiles of cognitive decline in the elderly associated 
with the transition intensity of Alzheimer’s disease and death. We compared markovian 
and semi-markovian models, in order to determine whether the transition intensity of death 
among demented subjects depended more on age than on duration of dementia. 


4.1 Data 


The Paquid cohort (Letenneur et ah, 1994) involves 3,777 subjects from two French admin¬ 


istrative departments, Dordogne and Gironde. The subjects were 65 years old or older at 
entry and they were visited every 2 or 3 years at home to undergo a battery of psychometric 
tests. The diagnosis of dementia was based on a two-phase screening procedure according to 


DSM III R criteria for dementia (American Psychiatric Association, 1987). In this work, we 


focused on the ISAACS set test, scored from 0 to 40, assessing verbal fluency. Subjects had 
to produce up to 10 words from four different semantic categories within 15 seconds for each 
category. 


We selected subjects who were healthy at inclusion and who completed at least one ISAACS 
set test (until their diagnosis for subjects diagnosed as demented). Among the 3,777 subjects 
of the initial sample, 102 were excluded because they were prevalent cases at the first visit 
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and 150 were excluded because they had completed no tests during the follow-up. Finally, the 
sample under study involved 3,525 subjects, including 57.8% of women and 65.9% of subjects 
with a low level of education. A total of 23.8% of subjects were diagnosed as demented, 
including 19.8% who died during the follow-up, and 65.1% who died before the dementia 
diagnosis. 


4.2 Comparison of models 

The joint markovian model and the joint semi-markovian model were estimated and com¬ 
pared on the Paquid cohort. In both models, the sub-model for the longitudinal marker 
assumed a quadratic trend with three class-specific fixed and random effects of time and 
common effects for gender (Sex = 0 for men, 1 for women) and educational level (Educ = 
1 for subjects who obtained their primary school diploma, 0 for others). Given class g , the 
mixed model was defined by: 


t 2 


A i(t) —f3o g + Pig t + @ 2 g + Pz EduCi + Pi EduCi x t 

1 2 ^2 
+ p 5 Edud x — + p e Sexi + ufj + u^t + ufj —, 


tg *9 

where u ig = (u\®\ u[^\ uf^) T ~ Af (0, cr'yB). Time is a linear transformation of age : t = 
a9e i ~ 65 . The interaction between time and gender was not accounted for because it appeared 
insignificant in previous analyses. A Beta cumulative distribution function was used to link 
the observed scores to the latent process: 

77 (1) , 77 (2) , 77 (3) , 77 (4) ) = B ^ l3,n , I - = + e . jt ( 12 ) 


Y = 

1 V 


V 


(4) 


where e^- ~ Af(0,a^). The models for the transition intensities were proportional hazards 
models with common effects of the two covariates and class-specific Weibull baseline transi¬ 
tion intensities, as functions of age. We compared the markovian model defined by: 


Otklig(t) = oP k i g (t) exp ( 7 ^ Se^ + 7 ^ Educj), k e [0,1], 1 G [1, 2], 
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and the semi-markovian model where the third transition intensity had the following form: 

«12 ig(t, T A ) = a° 12g (t - T A ) exp( 7 ^ Sex ; + Educ ; ), 
where T A is the age at dementia onset. 

[Table 2 about here.] 

Table [2] shows that the markovian model fits the data better, with smaller BIC values 
irrespective of the number of classes. Thus, the transition intensity of death among demented 
subjects depends more on age than on dementia duration. 

The minimum value of BIC was obtained with G = 4 classes within the markovian frame¬ 
work. Thus, this model will be detailed below. Note that a higher degree of heterogeneity 
is expected within the semi-markovian framework, since the minimum value of BIC was 
obtained with five classes, but the fit was not as good as the four-class markovian model. 

4.3 Results 

Figure [3] represents the three transition intensities (Part A), the three cumulative incidences 
(Part B) and the estimated mean ISAACS trajectories (Part C) for men with a low level of 
education (with no primary school diploma), in each class. The Erst class includes 7.3% of the 
sample and will be denoted as the ‘healthy class’ as it has the lowest transition intensities of 
dementia and death until advanced ages, as well as the slightest mean decline of the ISAACS 
set test score (Figure [3] C). The cumulative incidences show that most dementia and death 
occurrences arise after age 85 in this class (Figure [3] B). At the other extreme, the second 
class, including 8 . 0 % of the population, will be denoted the ‘high transition intensities class’ 
since it has the highest transition intensities of dementia and death for healthy people and 
the fastest and deepest decline of ISAACS scores. As shown in Figure [3] B, most of these 
subjects are demented or dead before age 80. The third class, accounting for 34.2% of the 
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population, also has a high transition intensity of dementia and a steep cognitive decline, 
but these occur at later ages than in the second class, and a medium transition intensity 
of death for healthy people. The cumulative incidences show that about 60% of this group 
die without dementia before age 85 and 30% become demented before age 90. Finally, the 
fourth class includes 50.5% of the population and is quite similar to the first one, with more 
pronounced cognitive decline and higher transition intensities of death among both healthy 
and demented subjects. About 80% die without dementia and 20% become demented before 
age 95 in this class whereas these figures are only reached 5 to 10 years later in the first class. 


[Figure 3 about here.] 

Estimates of the main parameters of the retained model are presented in Web Table 3. 
Educational level is associated with lower transition intensities of dementia and death among 
healthy people but not with the transition intensity of death among demented people, after 
adjusting for the heterogeneity of the transition intensities due to the classes. Gender is not 
associated with the transition intensity of dementia but healthy and demented females have 
a lower transition intensity of death. Note that these estimates do not represent the global 
effect of gender and education on the population since the proportion of women and educated 
subjects is quite different between classes (see Web Table 4). 


4.4 Goodness-of-fit 


As loss of follow-up may be linked to a change in the cognitive test (Jacqmin-Gadda et al. 


1997), missing data are not missing completely at random; so we assessed the goodness- 


of-fit conditionally on random effects. Web Figure 1 displays the predicted weighted mean 
of ISAACS scores given the random effects and the classes, and the weighted mean of the 
observed scores for each class. The predicted mean is close to the observed mean and is 


within the confidence interval which increases over time as there is less data. The estimated 
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class-specific cumulative incidences averaged on the covariates on part B of Web Figure 1 are 
compared to the estimations, also averaged on the covariates, obtained by a semi-parametric 
illness-death model with baseline transition intensities modelled by M-splines, estimated with 
a weighted penalized likelihood. Here again, the estimations are close and the graphs show 
that the model fits the data well. 

4.5 Posterior classification 

Considering each posterior class, we compute the mean probability of belonging to each of 
the four classes in order to quantify the discriminatory ability of the model (see Web Table 
5). For each class, the probability of belonging to the allocated class is above 63% and the 
mean probability of belonging to another class is less than 10%, which is quite satisfactory. 

4.6 Post-fit trajectories 

It is of interest to estimate the typical cognitive decline of subjects who were demented or 
deceased at a given age, as well as the evolution of subjects who were alive and non-demented 
at an advanced age. Thus, we computed the mean trajectories of the ISAACS scores for a 
man with a low level of education for 4 different cases: alive and healthy at age 95 or at 85, 
dead without dementia at age 80 and a man with dementia onset at age 80. The expectation 
for the first case is given by 

G 

E(Y(t)\Tf > 95, T® > 95; § G ) = = g-§ G )P( Ci = g\T A > 95, if > 95; § G ), 

9=1 

(13) 

and the other cases are computed the same way. As expected, we observe on Figure [| that 
the trajectories of the men alive and healthy are the highest ones with the smallest declines 
(solid and dashed lines). Nevertheless, these estimates show a slight decline of the ISAACS 
scores in older ages among healthy people, probably due to a slowing of cognitive processes. 
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The decline of the man who dies at age 80 without dementia is slightly more pronounced, 
highlighting the so-called terminal decline before death, while the decline of the man who 
becomes demented at age 80 is deeper. The ‘terminal decline’ has been described by other 


authors (Wilson et ah, 2003), however, it has not been distinguished from the decline toward 


dementia. As expected, the decline is deeper in the pre-dementia phase (dotdashed curves). 


[Figure 4 about here.] 


5. Discussion 

We proposed a joint latent class illness-death model for semi-competing interval-censored 
events and longitudinal data. Joint models have previously been developed to capture the 
correlation between a longitudinal marker and competing risks but no previous model has 


accounted for interval censoring (Elashoff et al., 2008 Williamson et al., 2008 Proust 


Lima and Jacqmin-Gadda, 2015, submitted). Our simulations highlighted biased estimates 


of the joint model for standard competing risks. By dealing with interval censoring, the 
proposed method corrects for this bias and highlights different profiles of cognitive decline 
associated with different transition intensities of death and dementia. Subsequently, the mean 
trajectories of cognitive decline for people who were demented or deceased at a given age 
can be estimated. This makes it possible to distinguish the cognitive decline of the healthy 
elderly from the cognitive decline before death without dementia and the cognitive decline 
in the pre-dementia phase that appears to be the steepest. 


We chose a latent class approach to account for the heterogeneity of cognitive aging. It 
could be also interesting to develop a joint model for semi-competing interval-censored events 
and longitudinal data in a shared random effects framework. Nevertheless, with some realistic 
assumptions, handling of interval censoring is not as critical in shared random effects models 
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because the transition intensity of death may depend on the current value of the marker 
and possibly on the current slope. Thus, conditionally on the current value and the slope 
of the marker (which is the cognitive test in our application), it may be sensible to assume 
that the transition intensity of death is identical for demented and healthy subjects. In this 
case, as shown in Dantan et al. (2011), it is not necessary to distinguish possible individual 
trajectories between the last visit without dementia and the end of follow-up or death. The 
likelihood for interval-censored data is identical to the one without interval censoring, as 
long as we impute the middle of the censoring interval for subjects diagnosed as demented. 


Moreover, Leffondre et al. (2013) showed that the uncertainty regarding time to dementia 


onset, among the subjects diagnosed as demented, is not a major issue as long as the intervals 
between visits are short enough. Nevertheless, extending the joint shared random effects 
model to include an illness-death submodel instead of a standard times-to-events model 
could be useful to study the association between marker evolution and transition intensity 
of death among diseased subjects. 


A score test was previously developed (Jacqmin-Gadda et al. 2010) to assess the assump 


tion of independence between the marker and the time to an event conditionally on the latent 
classes. This score was extended to the framework of competing risks and multiple markers 


by Proust-Lima and Jacqmin-Gadda (2015, submitted). In the future, an extension to joint 


illness-death model accounting for interval censoring could be useful. 
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Class-specific transition intensities 


Figure 1. Joint latent class illness-death model: the latent classes correspond to homo¬ 
geneous subgroups of subjects with a specific marker trajectory and specific transition 
intensities to dementia and death. 
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| Inclusion 

I Last visit subject 
seen healthy 

0 Visitofdiagnosis 

Lost to follow-up 
Death 



6 A =1 6 d =1 case 1 


6 A = 16 D =o case 2 


6 A =0 6 D =1 case 3 


6 A =0 6 D =0 case 4 


6 A =0 6 D =1 case 5 


6 a =0 6 d =0 case 6 


Figure 2. Possible observation patterns for dementia and death. To obtain a more flexible 
program, we also implemented the computation of the likelihood for subjects with an exact 
date of dementia onset (L* = Ri ) although this date was never known in our dataset. 
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70 80 90 Too ° 70 80 90 100 ° 70 80 90 100 

age age age 


A) Class-specific transition intensities of the illness-death model for men with a low level of 

education. 



age age age 

B) Class-specific cumulative incidences of the illness-death model for men with a low level 

of education. 

O' 



O' •■■■■! 

70 80 90 100 

age 

C) Class-specihc ISAACS set test score trajectories for men with a low level of education. 

Figure 3. Class-specihc estimated transition intensities, cumulative incidences and mean 
longitudinal trajectories of the latent class illness-death model for each class (class 1: dashed 
line, class 2: dotted line, class 3: dotdashed line, class 4: solid line). 
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Figure 4. Predicted ISAACS trajectories for a man with a low level of education, alive 
and healthy at age 95 (solid line) or 85 (dashed line), a man who dies at age 80 without 
dementia (dotted line) and a man who becomes demented at age 80 (dotdashed line). The 
trajectory is plotted from 65 years old until the age at dementia diagnosis, loss of follow-up 
or death. 





28 


Biometrics, February 2015 


(a) Visits every 2 years 
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Table 1 

Results of the simulation study comparing estimates of the joint latent class markovian illness-death model for 
interval-censored events and the joint latent class competing risks model. A total of 500 samples of 500 subjects were 
generated with a joint markovian illness-death model with visits every 2 or f years. ASE is the asymptotic standard 
error, ESE is the empirical standard error and the coverage rate is calculated from the 95% confidence interval. 
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BIC 



Markovian 

Semi- markovian 

G=1 

106928 

107050 

G=2 

106315 

106368 

G=3 

106120 

106185 

G=4 

106058 

106113 

G=5 

106091 

106099 


Table 2 

Comparison of BIC of markovian and semi-markovian joint latent class illness-death models, with a total number of 

classes varying from 1 to 5 (Paquid, N=3,525). 



